Proposed Model for Context Topic Identification of English and Hindi News Article Through LDA Approach with NLP Technique
نویسندگان
چکیده
According to the survey, India has world's second-largest newspaper market, with more than 100 K outlets, approx 240 million circulation, and 1300 subscribers or readers. The topic modeling work is increasing day by day, researchers have published multiple papers implemented them in different areas like software engineering, political science medical, etc. LDA used this research because it been introduced successfully for classification measures probability of a text-dependent on bag-of-words scheme without considering word series. common algorithm excellent implementation Gensim Python package. However, challenge how extract good quality topics that are simple, separated, meaningful. purpose deals finding main same category news articles which two languages (Hindi English) then classifying these language similarity measurement. In research, corpus constructed bigram. To achieve goal, we first build headline link extractor scrap top from Google News feeds both English Hindi (Google collects stories appeared website already accessible 35 over last 30 days) analyses headlines similar.
منابع مشابه
Online News Media Bias Analysis using an LDA-NLP Approach
It is widely recognized that every media outlet has its own ”spin” on news, and this bias has been described in many ways and at many levels. In political news for example, the bias can be liberal, conservative, moderate, corporate, etc. In addition, recent research has focused on the ’sentiment dimension’ to further identify and categorize news bias. This is achieved through analysis of the ad...
متن کاملthe relationship of wtc with communication apprehension and self-perceived communication competene in english and persian context
بیشتر تحقیقات پیشین در زمینه تمایل به برقراری ارتباط به رابطه آن با عوامل فردی چون سن، جنس، نوع شخصیت و... صورت گرفته است. در صورتی که مطالعات کمتری به بررسی رابطه تمایل به برقراری ارتباط زبان آموزان فارسی زبان با ترس از برقراری ارتباط و توانش خود ادراکانه آنها در برقراری ارتباط در محیط فارسی و انگلیسی انجام شده است. بر اساس نظریه الیس (2008) تمایل به برقراری ارتباط جایگاه مهمی در زمینه آموزش م...
15 صفحه اولinvestigating the feasibility of a proposed model for geometric design of deployable arch structures
deployable scissor type structures are composed of the so-called scissor-like elements (sles), which are connected to each other at an intermediate point through a pivotal connection and allow them to be folded into a compact bundle for storage or transport. several sles are connected to each other in order to form units with regular polygonal plan views. the sides and radii of the polygons are...
the use of appropriate madm model for ranking the vendors of mci equipments using fuzzy approach
abstract nowadays, the science of decision making has been paid to more attention due to the complexity of the problems of suppliers selection. as known, one of the efficient tools in economic and human resources development is the extension of communication networks in developing countries. so, the proper selection of suppliers of tc equipments is of concern very much. in this study, a ...
15 صفحه اولExperiences with English-Hindi, English-Tamil and English-Kannada Transliteration Tasks at NEWS 2009
We use a Phrase-Based Statistical Machine Translation approach to Transliteration where the words are replaced by characters and sentences by words. We employ the standard SMT tools like GIZA++ for learning alignments and Moses for learning the phrase tables and decoding. Besides tuning the standard SMT parameters, we focus on tuning the Character Sequence Model (CSM) related parameters like or...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of institution of engineers (India) series B
سال: 2021
ISSN: ['2250-2106', '2250-2114']
DOI: https://doi.org/10.1007/s40031-021-00655-w